Database Applications and Web-Enabled Databases
University of California, Berkeley School of Information IS 257: Database Management
IS 257 – Fall 2015 2015.10.01 - SLIDE 1
Announcements
IS 257 – Fall 2015 2015.10.01 - SLIDE 2
Lecture Outline
• Review – Database design review – Introduction to SQL and MySQL • Application Development in Access • Databases for Web Applications – Overview
IS 257 – Fall 2015 2015.10.01 - SLIDE 3
Lecture Outline
• Review – Database design review – Introduction to SQL & MySQL • Application Development in Access • Databases for Web Applications – Overview
IS 257 – Fall 2015 2015.10.01 - SLIDE 4
Database Design Process
Application 1 Application 2 Application 3 Application 4 External External External External Model Model Model Model Application 1 Conceptual requirements
Application 2 Conceptual requirements Internal Conceptual Logical Model Application 3 Model Model Conceptual requirements
Application 4 Conceptual requirements
IS 257 – Fall 2015 2015.10.01 - SLIDE 5
Cookie ER Diagram
pubid accno
BIBFILE CALLFILE LIBFILE
accno accno libid AU_BIB AU ID libid PUBFILE pubid
INDXFILE SUBFILE Note: diagram contains only AUTHORS attributes used accno subcode subcode for linking AU_ID Author
IS 257 – Fall 2015 2015.10.01 - SLIDE 6
Logical Model: Mapping to Relations
• Take each entity – Authors – BIBFILE – LIBFILE – CALLFILE – SUBFILE – PUBFILE – INDXFILE – AU_BIB • And make it a table...
IS 257 – Fall 2015 2015.10.01 - SLIDE 7
Physical Model: SQL for Creation
• We looked at how an SQL “script” could be created that would create each of the relational tables, define primary keys and indexes and load data into the database
IS 257 – Fall 2015 2015.10.01 - SLIDE 8
MySQL Data Types
• MySQL supports all of the standard SQL numeric data types. These types include the exact numeric data types (INTEGER, SMALLINT, DECIMAL, and NUMERIC), as well as the approximate numeric data types (FLOAT, REAL, and DOUBLE PRECISION). The keyword INT is a synonym for INTEGER, and the keyword DEC is a synonym for DECIMAL • Numeric (can also be declared as UNSIGNED) – BIT(n) (variable field of n bits) – BOOL or BOOLEAN (internally is TINYINT with value of 0 for FALSE) – TINYINT (1 byte) – SMALLINT (2 bytes) – MEDIUMINT (3 bytes) – INTEGER (4 bytes) – INT (4 bytes - Synonym) – BIGINT (8 bytes) – NUMERIC or DECIMAL (Packed - up to 65 digits - DEC, FIXED synonyms) – FLOAT – DOUBLE (or DOUBLE PRECISION) – SERIAL = BIGINT UNSIGNED NOT NULL AUTO_INCREMENT UNIQUE
IS 257 – Fall 2015 2015.10.01 - SLIDE 9
MySQL Data Types
• The date and time types for representing temporal values are DATETIME, DATE, TIMESTAMP, TIME, and YEAR. Each temporal type has a range of legal values, as well as a “zero” value that is used when you specify an illegal value that MySQL cannot represent – DATETIME '0000-00-00 00:00:00' – DATE '0000-00-00' – TIMESTAMP (4.1 and up) '0000-00-00 00:00:00' – TIMESTAMP (before 4.1) 00000000000000 – TIME '00:00:00' – YEAR 0000
IS 257 – Fall 2015 2015.10.01 - SLIDE 10
MySQL Data Types
• The string types are CHAR, VARCHAR, BINARY, VARBINARY, BLOB, TEXT, ENUM, and SET • Maximum length for CHAR is 255 and VARCHAR is 65,535 (limited by row size) Value CHAR(4) Storage VARCHAR(4) Storage "" " " 4 "" 1 "ab" "ab " 4 "ab" 3 "abcd" "abcd" 4 "abcd" 5 "abcdefg" "abcd" 4 "abcd" 5 • For longer things there is BLOB and TEXT
IS 257 – Fall 2015 2015.10.01 - SLIDE 11
MySQL Data Types
• A BLOB is a binary large object that can hold a variable amount of data. • The four BLOB types are TINYBLOB, BLOB, MEDIUMBLOB, and LONGBLOB. These differ only in the maximum length of the values they can hold • The four TEXT types are TINYTEXT, TEXT, MEDIUMTEXT, and LONGTEXT. These correspond to the four BLOB types and have the same maximum lengths and storage requirements • TINY=1byte, BLOB and TEXT=2bytes, MEDIUM=3bytes, LONG=4bytes
IS 257 – Fall 2015 2015.10.01 - SLIDE 12
MySQL Data Types
• BINARY and VARBINARY are like CHAR and VARCHAR but are intended for binary data of 255 bytes or less • ENUM is a list of values that are stored as their addresses in the list – For example, a column specified as ENUM('one', 'two', 'three') can have any of the values shown here. The index of each value is also shown: • Value = Index • NULL = NULL • ‘’ = 0 • 'one’ = 1 • ‘two’ = 2 • ‘three’ = 3 – An enumeration can have a maximum of 65,535 elements.
IS 257 – Fall 2015 2015.10.01 - SLIDE 13
MySQL Data Types
• The final string type (for this version) is a SET • A SET is a string object that can have zero or more values, each of which must be chosen from a list of allowed values specified when the table is created. • SET column values that consist of multiple set members are specified with members separated by commas (‘,’) • For example, a column specified as SET('one', 'two') NOT NULL can have any of these values: – '' – 'one' – 'two' – 'one,two‘ • A set can have up to 64 member values and is stored as an 8byte number
IS 257 – Fall 2015 2015.10.01 - SLIDE 14
ALTER Table
• ALTER TABLE table-name ADD COLUMN col_name col_definition; • … DROP COLUMN col_name; • … CHANGE col_name new_col_definition; • Adds/removes a new column from an existing database table • Many other options for adding constraints (like NOT NULL, or PRIMARY KEY), etc.
IS 257 – Fall 2015 2015.10.01 - SLIDE 15
INSERT
• INSERT INTO table-name (attr1, attr4, attr5,…, attrK) VALUES (“val1”, val4, val5,…, “valK”); • Adds a new row(s) to a table. • INSERT INTO table-name (attr1, attr4, attr5,…, attrK) VALUES SELECT ...
IS 257 – Fall 2015 2015.10.01 - SLIDE 16
Creating a new table data from existing tables
• Syntax: – INSERT INTO tablename (attr1, attr2, attr3) SELECT [DISTINCT] xattr1, xattr2, xattr3 FROM rel1 r1, rel2 r2,… rel3 r3 WHERE condition1 {AND | OR} condition2 ORDER BY attr1 [DESC], attr3 [DESC]
tablename has to previously exist for this to work in MySQL…
IS 257 – Fall 2015 2015.10.01 - SLIDE 17
DELETE
• DELETE FROM table-name WHERE
IS 257 – Fall 2015 2015.10.01 - SLIDE 18
UPDATE
• UPDATE tablename SET attr1=newval, attr2 = newval2 WHERE
IS 257 – Fall 2015 2015.10.01 - SLIDE 19
DROP Table
• DROP TABLE tablename; • Removes a table from the database.
IS 257 – Fall 2015 2015.10.01 - SLIDE 20
CREATE INDEX
• CREATE [ UNIQUE|FULLTEXT|SPATIAL ] INDEX indexname indextype ON tablename (attr1 [ASC|DESC][, attr2 [ASC| DESC], ...]) [USING [BTREE|HASH| RTREE]]
IS 257 – Fall 2015 2015.10.01 - SLIDE 21
Lecture Outline
• Review – Introduction to SQL and MySQL • Application Development in Access • Databases for Web Applications – Overview
IS 257 – Fall 2015 2015.10.01 - SLIDE 22
Database Applications
• Generally, end-users of database data probably do not want to learn SQL in order to access the information in the database • Instead, they would prefer to use a familiar PC or Web interface that uses the graphical conventions and behaviors that they are familiar with • Today we will look briefly at PC –style client applications using systems like Access and Web-based systems
IS 257 – Fall 2015 2015.10.01 - SLIDE 23
Access Usability Hierarchy
API
VBA
MACROS
Functions/Expressions Objects – Tables, queries Forms, Reports From McFadden Chap. 10
IS 257 – Fall 2015 2015.10.01 - SLIDE 24
Examples
• Access OBJECT level – QBE querying • Building Application interfaces – User wants “point and click” and forms to fill in, not a Query editing screen or wizard – How to build them • Drag and drop as in Access • Programming Languages • 4th Generation languages (more on these later)
IS 257 – Fall 2015 2015.10.01 - SLIDE 25
Query-by-Example
• QBE was developed in the 1970s as a simpler to use interface for IBM mainframe databases • In QBE the user puts parts of what they want to get from the database into a form similar to what the output will look like • The Query Design View in Access is an example of QBE
IS 257 – Fall 2015 2015.10.01 - SLIDE 26
Access Query Interface…
! What sites might Lorraine Vega dive on her trip? – SQL generated… SELECT DIVECUST.Name, DEST.[Destination Name], SITES.[Site Name] FROM ((DIVECUST INNER JOIN DIVEORDS ON DIVECUST.[Customer No] = DIVEORDS.[Customer No]) INNER JOIN DEST ON DIVEORDS.Destination = DEST.[Destination Name]) INNER JOIN SITES ON DEST.[Destination No] = SITES.[Destination No] WHERE (((DIVECUST.Name) Like "*Vega"));
IS 257 – Fall 2015 2015.10.01 - SLIDE 27
Access Query Interface
• Output is generated in a window…
IS 257 – Fall 2015 2015.10.01 - SLIDE 28
The MS JET Database Engine
Database app Database app
Visual Basic Access Excel Word
Visual Basic for Applications (VBA) Host Languages for the Jet DBMS
Data Access Objects (DAO) Includes DDL and DML Jet Query Internal Replication Engine ISAM Engine Jet Database Engine (Jet DBMS)
Adapted from Roman, Database “Access Database Design and Programming”
IS 257 – Fall 2015 2015.10.01 - SLIDE 29
Using Access for Applications
• Forms • Reports • Macros • VBA programming • Application framework • HTML Pages
IS 257 – Fall 2015 2015.10.01 - SLIDE 30
Access Applications
IS 257 – Fall 2015 2015.10.01 - SLIDE 31
Access Forms
IS 257 – Fall 2015 2015.10.01 - SLIDE 32
Access Forms
IS 257 – Fall 2015 2015.10.01 - SLIDE 33
Forms – including query results
IS 257 – Fall 2015 2015.10.01 - SLIDE 34
Form Layout and Design
IS 257 – Fall 2015 2015.10.01 - SLIDE 35
Reports
IS 257 – Fall 2015 2015.10.01 - SLIDE 36
Report Design
IS 257 – Fall 2015 2015.10.01 - SLIDE 37
Reports – design and result
IS 257 – Fall 2015 2015.10.01 - SLIDE 38
Access Relationships
IS 257 – Fall 2015 2015.10.01 - SLIDE 39
Lecture Outline
• Review – Introduction to SQL • Application Development in Access • Databases for Web Applications – Overview
IS 257 – Fall 2015 2015.10.01 - SLIDE 40
Overview
• Why use a database system for Web design and e-commerce? • What systems are available? • Pros and Cons of different web database systems? • Text retrieval in database systems • Search Engines for Intranet and Intrasite searching
IS 257 – Fall 2015 2015.10.01 - SLIDE 41
Why Use a Database System?
• Simple Web sites with only a few pages don’t need much more than static HTML files
IS 257 – Fall 2015 2015.10.01 - SLIDE 42
Simple Web Applications
Web Files Server
Internet Server
Clients
IS 257 – Fall 2015 2015.10.01 - SLIDE 43
Adding Dynamic Content to the Site
• Small sites can often use simple HTML and CGI scripts accessing data files to create dynamic content for small sites.
IS 257 – Fall 2015 2015.10.01 - SLIDE 44
Dynamic Web Applications 1
Web Files Server CGI
Internet
Server
Clients
IS 257 – Fall 2015 2015.10.01 - SLIDE 45
Issues For Scaling Up Web Applications
• Performance • Scalability • Maintenance • Data Integrity • Transaction support
IS 257 – Fall 2015 2015.10.01 - SLIDE 46
Performance Issues
• Problems arise as both the data to be managed and usage of the site grows. – Interpreted CGI scripts are inherently slower than compiled native programs – Starting CGI applications takes time for each connection – Load on the system compounds the problem – Tied to other scalability issues
IS 257 – Fall 2015 2015.10.01 - SLIDE 47
Scalability Issues
• Well-designed database systems will permit the applications to scale to accommodate very large databases – A script that works fine scanning a small data file may become unusable when the file becomes large. – Issues of transaction workload on the site • Starting a separate copy of a CGI program for each user is NOT a scalable solution as the workload grows
IS 257 – Fall 2015 2015.10.01 - SLIDE 48
Maintenance Issues
• Dealing with multiple data files (customer list, product list, customer orders, etc.) using CGI means: – If any data element in one of the files changes, all scripts that access that file must be rewritten – If files are linked, the programs must insure that data in all the files remains synchronized – A large part of maintenance will involve dealing with data integrity issues – Unanticipated requirements may require rewriting scripts
IS 257 – Fall 2015 2015.10.01 - SLIDE 49
Data Integrity Constraint Issues
• These are constraints we wish to impose in order to protect the database from becoming inconsistent. • Five basic types – Required data – attribute domain constraints – entity integrity – referential integrity – enterprise constraints
IS 257 – Fall 2015 2015.10.01 - SLIDE 50
Transaction support
• Concurrency control (ensuring the validity of database updates in a shared multiuser environment).
IS 257 – Fall 2015 2015.10.01 - SLIDE 51
No Concurrency Control: Lost updates
John Marsha • Read account balance (balance = $1000) • Read account balance (balance = $1000)
• Withdraw $200 (balance = $800) • Withdraw $300 (balance = $700) • Write account balance (balance = $800) • Write account balance (balance = $700)
ERROR!
IS 257 – Fall 2015 2015.10.01 - SLIDE 52
Concurrency Control: Locking
• Locking levels – Database – Table – Block or page – Record – Field • Types – Shared (S locks) – Exclusive (X locks)
IS 257 – Fall 2015 2015.10.01 - SLIDE 53
Concurrency Control: Updates with X locking
John Marsha • Lock account balance • Read account balance • Read account balance (balance = $1000) (DENIED) • Withdraw $200 (balance = $800) • Write account balance (balance = $800) • Unlock account balance • Lock account balance • Read account balance (balance = $800) • etc...
IS 257 – Fall 2015 2015.10.01 - SLIDE 54
Concurrency Control: Deadlocks John Marsha • Place S lock • Read account balance • Place S lock (balance = $1000) • Read account balance (balance = $1000) • Request X lock (denied)
• wait ... • Request X lock (denied)
• wait...
Deadlock!
IS 257 – Fall 2015 2015.10.01 - SLIDE 55
Transaction Processing
• Transactions should be ACID: – Atomic – Results of transaction are either all committed or all rolled back – Consistent – Data is transformed from one consistent state to another – Isolated – The results of a transaction are invisible to other transactions – Durable – Once committed the results of a transaction are permanent and survive system or media failures
IS 257 – Fall 2015 2015.10.01 - SLIDE 56
Why Use a Database System?
• Database systems have concentrated on providing solutions for all of these issues for scaling up Web applications – Performance – Scalability – Maintenance – Data Integrity – Transaction support • While systems differ in their support, most offer some support for all of these.
IS 257 – Fall 2015 2015.10.01 - SLIDE 57
Dynamic Web Applications 2
Web Files Server CGI
database Internet DBMS
database Server database
Clients
IS 257 – Fall 2015 2015.10.01 - SLIDE 58
Server Interfaces
HTML SQL Native DHTML DB Web ServerInterfaces Database JavaScript Web DB CGI App ODBC Native DB Web Server interfaces JDBC API’s ColdFusion PhP Perl Web Application
Server Java ASP Adapted from John P Ashenfelter, Choosing a Database for Your Web Site
IS 257 – Fall 2015 2015.10.01 - SLIDE 59
What Database systems are available?
• Choices depend on: – Size (current and projected) of the application – Hardware and OS Platforms to be used in the application – Features required • E.g.: SQL? Upgrade path? Full-text indexing? Attribute size limitations? Locking protocols? Direct Web Server access? Security? – Staff support for DBA, etc. – Programming support (or lack thereof) – Cost/complexity of administration – Budget
IS 257 – Fall 2015 2015.10.01 - SLIDE 60
Desktop Database Systems
System (producer)Platform SQL ODBC Scaling Price Access (Microsoft) Windows Yes Yes SQL Server ~$200 FoxPro (Microsoft) Windows, Mac Yes Yes SQL Server ~$200 FileMaker (FileMaker) Windows, Mac No No FileMaker Server ~$200 Excel (Microsoft) Windows, Mac No Yes Convert to Access~$200 Files (owner) Windows, Mac No No Import into DB ?
• Individuals or very small enterprises can create DBMS-enabled Web applications relatively inexpensively • Some systems will require an application server (such as ColdFusion) to provide the access path between the Web server and the DBMS
IS 257 – Fall 2015 2015.10.01 - SLIDE 61
Pros and Cons of Database Options
• Desktop databases – usually simple to set up and administer – inexpensive – often will not scale to a very large number of users or very large database size – May lack locking management appropriate for multiuser access – Poor handling for full-text search – Well supported by application software (Coldfusion, PHP, etc.)
IS 257 – Fall 2015 2015.10.01 - SLIDE 62
Enterprise Database Systems
System Platform SQL ODBC JDBC Web? SQL-Server (Microsoft) WIndowsNT -2000 Yes Yes ? Yes (IIS) Oracle Internet Platform Unix, Linux, NT Yes Yes Yes Yes Informix Internet Foundation.2000 Unix, Linux, NT Yes Yes Yes Yes Sybase Adaptive Server Unix, Linux, NT Yes Yes Yes Yes DB2 (IBM) IBM,Unix, Linux, NT Yes Yes Yes Yes? • Enterprise servers are powerful and available in many different configurations • They also tend to be VERY expensive • Pricing is usually based on users, or CPU’s
IS 257 – Fall 2015 2015.10.01 - SLIDE 63
Pros and Cons of Database Options
• Enterprise databases – Can be very complex to set up and administer • Oracle, for example recommends RAID-1 with 7x2 disk configuration as a bare minimum, more recommended – Expensive – Will scale to a very large number of users – Will scale to very large databases – Incorporate good transaction control and lock management – Native handling of Text search is poor, but most DBMS have add-on text search options – Support for applications software (ColdFusion, PHP, etc.)
IS 257 – Fall 2015 2015.10.01 - SLIDE 64
Free Database Servers
System Platform SQL ODBC JDBC Web? mSQL Unix, Linux Yes Yes No(?) No? MySQL Unix, Linux, NT Yes Yes No(?) No? PostgreSQL Unix, Linux, NT Yes Yes Yes No?
• System is free, but there is also no help line. • Include many of the features of Enterprise systems, but tend to be lighter weight • Versions may vary in support for different systems • Open Source -- So programmers can add features
IS 257 – Fall 2015 2015.10.01 - SLIDE 65
Pros and Cons of Database Options
• Free databases – Can be complex to set up and administer – Inexpensive (FREE!) – usually will scale to a large number of users – Incorporate good transaction control and lock management – Native handling of Text search has improved, and there are IR-like capabilities in MySQL and PostgreSQL – Support for applications software (ColdFusion, PHP, etc.)
IS 257 – Fall 2015 2015.10.01 - SLIDE 66
Embedded Database Servers
• May require programming experience to install • Tend to be fast and economical in space requirements • Includes many NOSQL databases
IS 257 – Fall 2015 2015.10.01 - SLIDE 67
Pros and Cons of Database Options
• Embedded databases – Must be embedded in a program – Can be incorporated in a scripting language – inexpensive (for non-commercial application) – May not scale to a very large number of users (depends on how it is used) – (May) Incorporate good transaction control and lock management – Text search support is minimal – May not support SQL
IS 257 – Fall 2015 2015.10.01 - SLIDE 68
NOSQL Databases System Platform SQL ODBC JDBC Web? MongoDB Unix, Linux, Win No No No ? REDIS Unix, Linux, Win NO NO No ?
IS 257 – Fall 2015 2015.10.01 - SLIDE 69
Database Security
• Different systems vary in security support: – Views or restricted subschemas – Authorization rules to identify users and the actions they can perform – User-defined procedures (and rule systems) to define additional constraints or limitations in using the database – Encryption to encode sensitive data – Authentication schemes to positively identify a person attempting to gain access to the database
IS 257 – Fall 2015 2015.10.01 - SLIDE 70
Views
• A subset of the database presented to some set of users. – SQL: CREATE VIEW viewname AS SELECT field1, field2, field3,…, FROM table1, table2 WHERE
IS 257 – Fall 2015 2015.10.01 - SLIDE 71
Authorization Rules
• Most current DBMS permit the DBA to define “access permissions” on a table by table basis (at least) using the GRANT and REVOKE SQL commands. • Some systems permit finer grained authorization (most use GRANT and REVOKE on variant views. • Some desktop systems have poor authorization support.
IS 257 – Fall 2015 2015.10.01 - SLIDE 72
Database Backup and Recovery
• Backup • Journaling (audit trail) • Checkpoint facility • Recovery manager
IS 257 – Fall 2015 2015.10.01 - SLIDE 73
Web Application Server Software
• ColdFusion • PHP • ASP • JSP • All of the are server-side scripting languages that embed code in HTML pages
IS 257 – Fall 2015 2015.10.01 - SLIDE 74
Coldfusion
• Coldfusion was one of the first server-side scripting languages and it is still available and used – Originally produced by a company called Allaire, it is now owned by Adobe and is in version 11 – It has always been a commercial product since the mid-1990’s
IS 257 – Fall 2015 2015.10.01 - SLIDE 75
What ColdFusion is Good for
• Putting up databases onto the Web • Handling dynamic databases (Frequent updates, etc) • Making databases searchable and updateable by users • The basic scripting elements are simple, and similar in style to other server-side scripting languages (but the syntax is often different)
IS 257 – Fall 2015 2015.10.01 - SLIDE 76
Coldfusion
• The Coldfusion engine runs in parallel with the web server, and is passed any page in the web server directories that has the appropriate file name extension (.cfm) • The engine processes any Coldfusion script on the web page and passes back an HTML page with the scripts replaced by the script result • As a simple example…
IS 257 – Fall 2015 2015.10.01 - SLIDE 77
Coldfusion Templates
• Assume we have a database named contents_of_my_shopping_cart.mdb -- single table called contents... – With attributes “Item”, “Date_of_item”, “Price” • Create an HTML page (uses extension .cfm), before
... •IS 257 – Fall 2015 2015.10.01 - SLIDE 78
Coldfusion Templates cont.
• … the cfquery goes here… •
•Contents of My Shopping Cart
•• #Date_of_item#
• $#Price#
•
IS 257 – Fall 2015 2015.10.01 - SLIDE 79
Templates cont.
Contents of My Shopping Cart
Bouncy Ball with Psychedelic Markings 12 December 1998 $0.25
Shiny Blue Widget 14 December 1998 $2.53
Large Orange Widget 14 December 1998 $3.75
IS 257 – Fall 2015 2015.10.01 - SLIDE 80
CFIF and CFELSE
IS 257 – Fall 2015 2015.10.01 - SLIDE 81
More Templates
Employee Added
IS 257 – Fall 2015 2015.10.01 - SLIDE 82
CFML ColdFusion Markup Language
• Read data from and update data to databases and tables • Create dynamic data-driven pages • Perform conditional processing • Populate forms with live data • Process form submissions • Generate and retrieve email messages • Perform HTTP and FTP function • Perform credit card verification and authorization • Read and write client-side cookies
IS 257 – Fall 2015 2015.10.01 - SLIDE 83
Next time
• More on Database Applications: PHP and MySQL
IS 257 – Fall 2015 2015.10.01 - SLIDE 84